Assamese Text to Speech Corpus

0 reviews requests (1)

Owner Central Institute of Indian Languages

Catalogue Number: 1514

Stock In Stock

OverView

Assamese Text to Speech Corpus 44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual...

Please Login to see the price

Tags: Assamese; Text; Speech; Corpus

Categories Cart Account Search Recent View Go to Top

Dataset Description

Assamese Text to Speech Corpus

44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers

The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Assamese script. This dataset spans a duration of 44:49:34 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Assamese speakers. A comprehensive explanation of dataset can be found in the Assamese Text to Speech Documentation.

For any research-based citations, please use the following citations:

Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Assamese Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-45-3.
Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.

Item specifics

Authors Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan.
Corpus Type Text to Speech Corpus
Catalogue Number 1514
ISBN 978-93-48633-45-3
Data Source On Field
Duration 44:49:34 hours
# of Audio Segments 32594
Release Date 3/20/2025
Terms and Conditions General instructions for use of the resources provided by LDC-IL.

Assamese Text to Speech Corpus

OverView

Assamese Text to Speech Corpus

Dataset Description

Item specifics

Write a review